Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 3677 |
| Missing cells | 9171 |
| Missing cells (%) | 10.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.2 MiB |
| Average record size in memory | 631.8 B |
Variable types
| Text | 3 |
|---|---|
| Categorical | 10 |
| Numeric | 10 |
store room is highly imbalanced (55.7%) | Imbalance |
floorNum has 2431 (66.1%) missing values | Missing |
facing has 1045 (28.4%) missing values | Missing |
super_built_up_area has 1802 (49.0%) missing values | Missing |
built_up_area has 2036 (55.4%) missing values | Missing |
carpet_area has 1805 (49.1%) missing values | Missing |
area is highly skewed (γ1 = 29.73095338) | Skewed |
built_up_area is highly skewed (γ1 = 40.14464398) | Skewed |
carpet_area is highly skewed (γ1 = 24.33323909) | Skewed |
luxury_score has 481 (13.1%) zeros | Zeros |
Reproduction
| Analysis started | 2024-02-13 04:16:15.312464 |
|---|---|
| Analysis finished | 2024-02-13 04:16:24.723817 |
| Duration | 9.41 seconds |
| Software version | ydata-profiling vv4.6.4 |
| Download configuration | config.json |
society
Text
| Distinct | 675 |
|---|---|
| Distinct (%) | 18.4% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 293.9 KiB |
Length
| Max length | 49 |
|---|---|
| Median length | 39 |
| Mean length | 16.863166 |
| Min length | 1 |
Characters and Unicode
| Total characters | 61989 |
|---|---|
| Distinct characters | 41 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 307 ? |
|---|---|
| Unique (%) | 8.4% |
Sample
| 1st row | the lions cghs |
|---|---|
| 2nd row | bestech park view residency |
| 3rd row | bptp freedom park life |
| 4th row | ss the leaf |
| 5th row | vatika city homes |
| Value | Count | Frequency (%) |
| independent | 491 | 5.1% |
| the | 350 | 3.6% |
| dlf | 220 | 2.3% |
| park | 209 | 2.2% |
| city | 166 | 1.7% |
| emaar | 155 | 1.6% |
| global | 153 | 1.6% |
| m3m | 152 | 1.6% |
| signature | 150 | 1.6% |
| heights | 134 | 1.4% |
| Other values (781) | 7497 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 6701 | 10.8% |
| 6003 | 9.7% | |
| a | 5864 | 9.5% |
| r | 4171 | 6.7% |
| n | 4160 | 6.7% |
| i | 3827 | 6.2% |
| t | 3716 | 6.0% |
| s | 3472 | 5.6% |
| l | 2943 | 4.7% |
| o | 2755 | 4.4% |
| Other values (31) | 18377 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 55441 | |
| Space Separator | 6003 | 9.7% |
| Decimal Number | 527 | 0.9% |
| Other Punctuation | 10 | < 0.1% |
| Dash Punctuation | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 6701 | |
| a | 5864 | 10.6% |
| r | 4171 | 7.5% |
| n | 4160 | 7.5% |
| i | 3827 | 6.9% |
| t | 3716 | 6.7% |
| s | 3472 | 6.3% |
| l | 2943 | 5.3% |
| o | 2755 | 5.0% |
| d | 2482 | 4.5% |
| Other values (16) | 15350 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 207 | |
| 2 | 82 | 15.6% |
| 1 | 75 | 14.2% |
| 6 | 56 | 10.6% |
| 8 | 32 | 6.1% |
| 4 | 19 | 3.6% |
| 5 | 17 | 3.2% |
| 0 | 13 | 2.5% |
| 9 | 13 | 2.5% |
| 7 | 13 | 2.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 7 | |
| / | 2 | 20.0% |
| . | 1 | 10.0% |
Space Separator
| Value | Count | Frequency (%) |
| 6003 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 55441 | |
| Common | 6548 | 10.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 6701 | |
| a | 5864 | 10.6% |
| r | 4171 | 7.5% |
| n | 4160 | 7.5% |
| i | 3827 | 6.9% |
| t | 3716 | 6.7% |
| s | 3472 | 6.3% |
| l | 2943 | 5.3% |
| o | 2755 | 5.0% |
| d | 2482 | 4.5% |
| Other values (16) | 15350 |
Common
| Value | Count | Frequency (%) |
| 6003 | ||
| 3 | 207 | 3.2% |
| 2 | 82 | 1.3% |
| 1 | 75 | 1.1% |
| 6 | 56 | 0.9% |
| 8 | 32 | 0.5% |
| 4 | 19 | 0.3% |
| 5 | 17 | 0.3% |
| 0 | 13 | 0.2% |
| 9 | 13 | 0.2% |
| Other values (5) | 31 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 61989 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 6701 | 10.8% |
| 6003 | 9.7% | |
| a | 5864 | 9.5% |
| r | 4171 | 6.7% |
| n | 4160 | 6.7% |
| i | 3827 | 6.2% |
| t | 3716 | 6.0% |
| s | 3472 | 5.6% |
| l | 2943 | 4.7% |
| o | 2755 | 4.4% |
| Other values (31) | 18377 |
property_type
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 248.6 KiB |
| flat | |
|---|---|
| house |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.2336144 |
| Min length | 4 |
Characters and Unicode
| Total characters | 15567 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | flat |
|---|---|
| 2nd row | flat |
| 3rd row | flat |
| 4th row | flat |
| 5th row | flat |
Common Values
| Value | Count | Frequency (%) |
| flat | 2818 | |
| house | 859 | 23.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| flat | 2818 | |
| house | 859 | 23.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 2818 | |
| l | 2818 | |
| a | 2818 | |
| t | 2818 | |
| h | 859 | 5.5% |
| o | 859 | 5.5% |
| u | 859 | 5.5% |
| s | 859 | 5.5% |
| e | 859 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15567 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 2818 | |
| l | 2818 | |
| a | 2818 | |
| t | 2818 | |
| h | 859 | 5.5% |
| o | 859 | 5.5% |
| u | 859 | 5.5% |
| s | 859 | 5.5% |
| e | 859 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15567 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 2818 | |
| l | 2818 | |
| a | 2818 | |
| t | 2818 | |
| h | 859 | 5.5% |
| o | 859 | 5.5% |
| u | 859 | 5.5% |
| s | 859 | 5.5% |
| e | 859 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15567 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 2818 | |
| l | 2818 | |
| a | 2818 | |
| t | 2818 | |
| h | 859 | 5.5% |
| o | 859 | 5.5% |
| u | 859 | 5.5% |
| s | 859 | 5.5% |
| e | 859 | 5.5% |
sector
Text
| Distinct | 113 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 266.9 KiB |
Length
| Max length | 26 |
|---|---|
| Median length | 9 |
| Mean length | 9.3209138 |
| Min length | 7 |
Characters and Unicode
| Total characters | 34273 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | sector 56 |
|---|---|
| 2nd row | sector 2 |
| 3rd row | sector 57 |
| 4th row | sector 85 |
| 5th row | sector 83 |
| Value | Count | Frequency (%) |
| sector | 3452 | |
| road | 178 | 2.4% |
| sohna | 166 | 2.2% |
| 85 | 108 | 1.5% |
| 102 | 107 | 1.4% |
| 92 | 100 | 1.4% |
| 69 | 93 | 1.3% |
| 90 | 89 | 1.2% |
| 81 | 87 | 1.2% |
| 65 | 87 | 1.2% |
| Other values (106) | 2915 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 3807 | |
| 3705 | ||
| s | 3697 | |
| r | 3697 | |
| e | 3542 | |
| c | 3503 | |
| t | 3463 | |
| 1 | 1076 | 3.1% |
| 0 | 804 | 2.3% |
| 8 | 780 | 2.3% |
| Other values (21) | 6199 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23299 | |
| Decimal Number | 7269 | 21.2% |
| Space Separator | 3705 | 10.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 3807 | |
| s | 3697 | |
| r | 3697 | |
| e | 3542 | |
| c | 3503 | |
| t | 3463 | |
| a | 699 | 3.0% |
| d | 249 | 1.1% |
| n | 221 | 0.9% |
| h | 203 | 0.9% |
| Other values (10) | 218 | 0.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1076 | |
| 0 | 804 | |
| 8 | 780 | |
| 9 | 764 | |
| 6 | 742 | |
| 7 | 684 | |
| 2 | 676 | |
| 3 | 666 | |
| 5 | 593 | |
| 4 | 484 |
Space Separator
| Value | Count | Frequency (%) |
| 3705 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23299 | |
| Common | 10974 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 3807 | |
| s | 3697 | |
| r | 3697 | |
| e | 3542 | |
| c | 3503 | |
| t | 3463 | |
| a | 699 | 3.0% |
| d | 249 | 1.1% |
| n | 221 | 0.9% |
| h | 203 | 0.9% |
| Other values (10) | 218 | 0.9% |
Common
| Value | Count | Frequency (%) |
| 3705 | ||
| 1 | 1076 | 9.8% |
| 0 | 804 | 7.3% |
| 8 | 780 | 7.1% |
| 9 | 764 | 7.0% |
| 6 | 742 | 6.8% |
| 7 | 684 | 6.2% |
| 2 | 676 | 6.2% |
| 3 | 666 | 6.1% |
| 5 | 593 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34273 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 3807 | |
| 3705 | ||
| s | 3697 | |
| r | 3697 | |
| e | 3542 | |
| c | 3503 | |
| t | 3463 | |
| 1 | 1076 | 3.1% |
| 0 | 804 | 2.3% |
| 8 | 780 | 2.3% |
| Other values (21) | 6199 |
price
Real number (ℝ)
| Distinct | 473 |
|---|---|
| Distinct (%) | 12.9% |
| Missing | 17 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.5336639 |
| Minimum | 0.07 |
|---|---|
| Maximum | 31.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 0.07 |
|---|---|
| 5-th percentile | 0.37 |
| Q1 | 0.95 |
| median | 1.52 |
| Q3 | 2.75 |
| 95-th percentile | 8.5 |
| Maximum | 31.5 |
| Range | 31.43 |
| Interquartile range (IQR) | 1.8 |
Descriptive statistics
| Standard deviation | 2.9806235 |
|---|---|
| Coefficient of variation (CV) | 1.1764084 |
| Kurtosis | 14.933373 |
| Mean | 2.5336639 |
| Median Absolute Deviation (MAD) | 0.72 |
| Skewness | 3.2791705 |
| Sum | 9273.21 |
| Variance | 8.8841164 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.25 | 80 | 2.2% |
| 1.2 | 64 | 1.7% |
| 1.5 | 64 | 1.7% |
| 0.9 | 63 | 1.7% |
| 1.1 | 62 | 1.7% |
| 1.4 | 60 | 1.6% |
| 1.3 | 57 | 1.6% |
| 2 | 52 | 1.4% |
| 0.95 | 52 | 1.4% |
| 1.6 | 48 | 1.3% |
| Other values (463) | 3058 |
| Value | Count | Frequency (%) |
| 0.07 | 1 | < 0.1% |
| 0.16 | 1 | < 0.1% |
| 0.17 | 1 | < 0.1% |
| 0.19 | 1 | < 0.1% |
| 0.2 | 8 | |
| 0.21 | 6 | |
| 0.22 | 8 | |
| 0.23 | 1 | < 0.1% |
| 0.24 | 6 | |
| 0.25 | 11 |
| Value | Count | Frequency (%) |
| 31.5 | 1 | < 0.1% |
| 27.5 | 1 | < 0.1% |
| 26 | 2 | |
| 25 | 1 | < 0.1% |
| 24 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 20 | 3 | |
| 19.5 | 2 | |
| 19 | 3 |
price_per_sqft
Real number (ℝ)
| Distinct | 2651 |
|---|---|
| Distinct (%) | 72.4% |
| Missing | 17 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13892.668 |
| Minimum | 4 |
|---|---|
| Maximum | 600000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 4715.95 |
| Q1 | 6817.25 |
| median | 9020 |
| Q3 | 13880.5 |
| 95-th percentile | 33333 |
| Maximum | 600000 |
| Range | 599996 |
| Interquartile range (IQR) | 7063.25 |
Descriptive statistics
| Standard deviation | 23210.067 |
|---|---|
| Coefficient of variation (CV) | 1.6706702 |
| Kurtosis | 186.92801 |
| Mean | 13892.668 |
| Median Absolute Deviation (MAD) | 2794 |
| Skewness | 11.43719 |
| Sum | 50847166 |
| Variance | 5.3870722 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 27 | 0.7% |
| 8000 | 19 | 0.5% |
| 5000 | 17 | 0.5% |
| 12500 | 14 | 0.4% |
| 11111 | 13 | 0.4% |
| 6666 | 13 | 0.4% |
| 22222 | 13 | 0.4% |
| 7500 | 12 | 0.3% |
| 8333 | 12 | 0.3% |
| 6000 | 11 | 0.3% |
| Other values (2641) | 3509 | |
| (Missing) | 17 | 0.5% |
| Value | Count | Frequency (%) |
| 4 | 1 | |
| 5 | 1 | |
| 7 | 1 | |
| 9 | 1 | |
| 53 | 1 | |
| 57 | 1 | |
| 58 | 2 | |
| 60 | 1 | |
| 61 | 1 | |
| 79 | 1 |
| Value | Count | Frequency (%) |
| 600000 | 1 | |
| 400000 | 1 | |
| 315789 | 1 | |
| 308333 | 1 | |
| 290948 | 1 | |
| 283333 | 1 | |
| 266666 | 1 | |
| 261194 | 1 | |
| 245398 | 1 | |
| 241666 | 1 |
area
Real number (ℝ)
SKEWED 
| Distinct | 2762 |
|---|---|
| Distinct (%) | 75.5% |
| Missing | 17 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2888.3887 |
| Minimum | 50 |
|---|---|
| Maximum | 875000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 50 |
|---|---|
| 5-th percentile | 518.8855 |
| Q1 | 1231.97 |
| median | 1733.015 |
| Q3 | 2300.095 |
| 95-th percentile | 4245.8975 |
| Maximum | 875000 |
| Range | 874950 |
| Interquartile range (IQR) | 1068.125 |
Descriptive statistics
| Standard deviation | 23167.504 |
|---|---|
| Coefficient of variation (CV) | 8.0209095 |
| Kurtosis | 942.02894 |
| Mean | 2888.3887 |
| Median Absolute Deviation (MAD) | 532.945 |
| Skewness | 29.730953 |
| Sum | 10571503 |
| Variance | 5.3673325 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2000 | 33 | 0.9% |
| 1000 | 18 | 0.5% |
| 1650.17 | 17 | 0.5% |
| 1250 | 13 | 0.4% |
| 500 | 11 | 0.3% |
| 900.09 | 10 | 0.3% |
| 1650.05 | 10 | 0.3% |
| 900 | 9 | 0.2% |
| 1350.01 | 9 | 0.2% |
| 1800.01 | 9 | 0.2% |
| Other values (2752) | 3521 | |
| (Missing) | 17 | 0.5% |
| Value | Count | Frequency (%) |
| 50 | 4 | |
| 55 | 1 | < 0.1% |
| 56 | 1 | < 0.1% |
| 57 | 1 | < 0.1% |
| 60 | 2 | |
| 61 | 1 | < 0.1% |
| 67 | 2 | |
| 70 | 1 | < 0.1% |
| 72 | 1 | < 0.1% |
| 76 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 875000 | 1 | |
| 642857.14 | 1 | |
| 620000 | 1 | |
| 566666.67 | 1 | |
| 215517.24 | 1 | |
| 98977.95 | 1 | |
| 82781.46 | 1 | |
| 65517.24 | 2 | |
| 65261.04 | 1 | |
| 58227.85 | 1 |
areaWithType
Text
| Distinct | 2355 |
|---|---|
| Distinct (%) | 64.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 428.2 KiB |
Length
| Max length | 124 |
|---|---|
| Median length | 119 |
| Mean length | 54.236062 |
| Min length | 12 |
Characters and Unicode
| Total characters | 199426 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1849 ? |
|---|---|
| Unique (%) | 50.3% |
Sample
| 1st row | Super Built up area 2400(222.97 sq.m.)Built Up area: 2000 sq.ft. (185.81 sq.m.)Carpet area: 1800 sq.ft. (167.23 sq.m.) |
|---|---|
| 2nd row | Super Built up area 1565(145.39 sq.m.) |
| 3rd row | Built Up area: 5010 (465.44 sq.m.) |
| 4th row | Super Built up area 1741(161.74 sq.m.)Built Up area: 1730 sq.ft. (160.72 sq.m.)Carpet area: 1720 sq.ft. (159.79 sq.m.) |
| 5th row | Super Built up area 1740(161.65 sq.m.)Carpet area: 1225 sq.ft. (113.81 sq.m.) |
| Value | Count | Frequency (%) |
| area | 5573 | |
| sq.m | 3655 | |
| up | 3020 | 10.0% |
| built | 2316 | 7.7% |
| super | 1875 | 6.2% |
| sq.ft | 1751 | 5.8% |
| sq.m.)carpet | 1185 | 3.9% |
| sq.m.)built | 702 | 2.3% |
| carpet | 683 | 2.3% |
| plot | 681 | 2.3% |
| Other values (2846) | 8700 |
Most occurring characters
| Value | Count | Frequency (%) |
| 26464 | 13.3% | |
| . | 20389 | 10.2% |
| a | 13154 | 6.6% |
| r | 9456 | 4.7% |
| e | 9320 | 4.7% |
| 1 | 9205 | 4.6% |
| s | 7567 | 3.8% |
| q | 7431 | 3.7% |
| t | 7324 | 3.7% |
| u | 6770 | 3.4% |
| Other values (25) | 82346 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 82758 | |
| Decimal Number | 47135 | |
| Space Separator | 26464 | 13.3% |
| Other Punctuation | 23406 | 11.7% |
| Uppercase Letter | 8593 | 4.3% |
| Close Punctuation | 5535 | 2.8% |
| Open Punctuation | 5535 | 2.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 13154 | |
| r | 9456 | |
| e | 9320 | |
| s | 7567 | |
| q | 7431 | |
| t | 7324 | |
| u | 6770 | |
| p | 6767 | |
| m | 5544 | |
| l | 3701 | 4.5% |
| Other values (5) | 5724 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 9205 | |
| 0 | 6628 | |
| 2 | 5688 | |
| 5 | 4714 | |
| 3 | 3960 | |
| 4 | 3711 | |
| 6 | 3674 | 7.8% |
| 7 | 3254 | 6.9% |
| 8 | 3157 | 6.7% |
| 9 | 3144 | 6.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 3020 | |
| S | 1875 | |
| C | 1872 | |
| U | 1145 | 13.3% |
| P | 681 | 7.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 20389 | |
| : | 3017 | 12.9% |
Space Separator
| Value | Count | Frequency (%) |
| 26464 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5535 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5535 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 108075 | |
| Latin | 91351 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 13154 | |
| r | 9456 | |
| e | 9320 | |
| s | 7567 | |
| q | 7431 | |
| t | 7324 | |
| u | 6770 | |
| p | 6767 | |
| m | 5544 | 6.1% |
| l | 3701 | 4.1% |
| Other values (10) | 14317 |
Common
| Value | Count | Frequency (%) |
| 26464 | ||
| . | 20389 | |
| 1 | 9205 | 8.5% |
| 0 | 6628 | 6.1% |
| 2 | 5688 | 5.3% |
| ) | 5535 | 5.1% |
| ( | 5535 | 5.1% |
| 5 | 4714 | 4.4% |
| 3 | 3960 | 3.7% |
| 4 | 3711 | 3.4% |
| Other values (5) | 16246 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 199426 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 26464 | 13.3% | |
| . | 20389 | 10.2% |
| a | 13154 | 6.6% |
| r | 9456 | 4.7% |
| e | 9320 | 4.7% |
| 1 | 9205 | 4.6% |
| s | 7567 | 3.8% |
| q | 7431 | 3.7% |
| t | 7324 | 3.7% |
| u | 6770 | 3.4% |
| Other values (25) | 82346 |
bedRoom
Real number (ℝ)
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3600761 |
| Minimum | 1 |
|---|---|
| Maximum | 21 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 21 |
| Range | 20 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.8976289 |
|---|---|
| Coefficient of variation (CV) | 0.56475771 |
| Kurtosis | 18.212873 |
| Mean | 3.3600761 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.4851418 |
| Sum | 12355 |
| Variance | 3.6009954 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 1496 | |
| 2 | 942 | |
| 4 | 660 | |
| 5 | 210 | 5.7% |
| 1 | 124 | 3.4% |
| 6 | 74 | 2.0% |
| 9 | 41 | 1.1% |
| 8 | 30 | 0.8% |
| 12 | 28 | 0.8% |
| 7 | 28 | 0.8% |
| Other values (9) | 44 | 1.2% |
| Value | Count | Frequency (%) |
| 1 | 124 | 3.4% |
| 2 | 942 | |
| 3 | 1496 | |
| 4 | 660 | |
| 5 | 210 | 5.7% |
| 6 | 74 | 2.0% |
| 7 | 28 | 0.8% |
| 8 | 30 | 0.8% |
| 9 | 41 | 1.1% |
| 10 | 20 | 0.5% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 19 | 2 | 0.1% |
| 18 | 2 | 0.1% |
| 16 | 12 | |
| 14 | 1 | < 0.1% |
| 13 | 4 | 0.1% |
| 12 | 28 | |
| 11 | 1 | < 0.1% |
| 10 | 20 |
bathroom
Real number (ℝ)
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.4245309 |
| Minimum | 1 |
|---|---|
| Maximum | 21 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 21 |
| Range | 20 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.9480681 |
|---|---|
| Coefficient of variation (CV) | 0.56885693 |
| Kurtosis | 17.542297 |
| Mean | 3.4245309 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.2488298 |
| Sum | 12592 |
| Variance | 3.7949693 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 1077 | |
| 2 | 1047 | |
| 4 | 820 | |
| 5 | 294 | 8.0% |
| 1 | 156 | 4.2% |
| 6 | 117 | 3.2% |
| 9 | 41 | 1.1% |
| 7 | 40 | 1.1% |
| 8 | 25 | 0.7% |
| 12 | 22 | 0.6% |
| Other values (9) | 38 | 1.0% |
| Value | Count | Frequency (%) |
| 1 | 156 | 4.2% |
| 2 | 1047 | |
| 3 | 1077 | |
| 4 | 820 | |
| 5 | 294 | 8.0% |
| 6 | 117 | 3.2% |
| 7 | 40 | 1.1% |
| 8 | 25 | 0.7% |
| 9 | 41 | 1.1% |
| 10 | 9 | 0.2% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 20 | 3 | 0.1% |
| 18 | 4 | 0.1% |
| 17 | 3 | 0.1% |
| 16 | 8 | 0.2% |
| 14 | 2 | 0.1% |
| 13 | 4 | 0.1% |
| 12 | 22 | |
| 11 | 4 | 0.1% |
| 10 | 9 |
balcony
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 238.2 KiB |
| 3+ | |
|---|---|
| 3 | |
| 2 | |
| 1 | |
| 0 | 96 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.3421267 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4935 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3+ |
|---|---|
| 2nd row | 3 |
| 3rd row | 3+ |
| 4th row | 3 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3+ | 1172 | |
| 3 | 1074 | |
| 2 | 884 | |
| 1 | 365 | 9.9% |
| 0 | 96 | 2.6% |
| No | 86 | 2.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 2246 | |
| 2 | 884 | 24.0% |
| 1 | 365 | 9.9% |
| 0 | 96 | 2.6% |
| no | 86 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 2246 | |
| + | 1172 | |
| 2 | 884 | 17.9% |
| 1 | 365 | 7.4% |
| 0 | 96 | 1.9% |
| N | 86 | 1.7% |
| o | 86 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3591 | |
| Math Symbol | 1172 | 23.7% |
| Uppercase Letter | 86 | 1.7% |
| Lowercase Letter | 86 | 1.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 2246 | |
| 2 | 884 | 24.6% |
| 1 | 365 | 10.2% |
| 0 | 96 | 2.7% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1172 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 86 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 86 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4763 | |
| Latin | 172 | 3.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 2246 | |
| + | 1172 | |
| 2 | 884 | 18.6% |
| 1 | 365 | 7.7% |
| 0 | 96 | 2.0% |
Latin
| Value | Count | Frequency (%) |
| N | 86 | |
| o | 86 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4935 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 2246 | |
| + | 1172 | |
| 2 | 884 | 17.9% |
| 1 | 365 | 7.4% |
| 0 | 96 | 1.9% |
| N | 86 | 1.7% |
| o | 86 | 1.7% |
floorNum
Real number (ℝ)
MISSING 
| Distinct | 20 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 2431 |
| Missing (%) | 66.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.0401284 |
| Minimum | 0 |
|---|---|
| Maximum | 51 |
| Zeros | 3 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 12 |
| Maximum | 51 |
| Range | 51 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 4.4260994 |
|---|---|
| Coefficient of variation (CV) | 1.0955343 |
| Kurtosis | 16.524098 |
| Mean | 4.0401284 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.2024875 |
| Sum | 5034 |
| Variance | 19.590356 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 493 | 13.4% |
| 3 | 273 | 7.4% |
| 12 | 158 | 4.3% |
| 1 | 152 | 4.1% |
| 4 | 127 | 3.5% |
| 22 | 13 | 0.4% |
| 5 | 8 | 0.2% |
| 14 | 3 | 0.1% |
| 0 | 3 | 0.1% |
| 6 | 3 | 0.1% |
| Other values (10) | 13 | 0.4% |
| (Missing) | 2431 |
| Value | Count | Frequency (%) |
| 0 | 3 | 0.1% |
| 1 | 152 | 4.1% |
| 2 | 493 | |
| 3 | 273 | |
| 4 | 127 | 3.5% |
| 5 | 8 | 0.2% |
| 6 | 3 | 0.1% |
| 10 | 2 | 0.1% |
| 11 | 2 | 0.1% |
| 12 | 158 | 4.3% |
| Value | Count | Frequency (%) |
| 51 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 32 | 2 | 0.1% |
| 27 | 1 | < 0.1% |
| 22 | 13 | |
| 21 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 14 | 3 | 0.1% |
| 13 | 1 | < 0.1% |
facing
Categorical
MISSING 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 1045 |
| Missing (%) | 28.4% |
| Memory size | 250.0 KiB |
| East | |
|---|---|
| North-East | |
| North | |
| West | |
| South | |
| Other values (3) |
Length
| Max length | 10 |
|---|---|
| Median length | 5 |
| Mean length | 6.8381459 |
| Min length | 4 |
Characters and Unicode
| Total characters | 17998 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | East |
|---|---|
| 2nd row | South-West |
| 3rd row | East |
| 4th row | South-East |
| 5th row | South-East |
Common Values
| Value | Count | Frequency (%) |
| East | 623 | |
| North-East | 623 | |
| North | 387 | 10.5% |
| West | 249 | 6.8% |
| South | 231 | 6.3% |
| North-West | 193 | 5.2% |
| South-East | 173 | 4.7% |
| South-West | 153 | 4.2% |
| (Missing) | 1045 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| east | 623 | |
| north-east | 623 | |
| north | 387 | |
| west | 249 | 9.5% |
| south | 231 | 8.8% |
| north-west | 193 | 7.3% |
| south-east | 173 | 6.6% |
| south-west | 153 | 5.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 3774 | |
| s | 2014 | |
| o | 1760 | |
| h | 1760 | |
| E | 1419 | 7.9% |
| a | 1419 | 7.9% |
| N | 1203 | 6.7% |
| r | 1203 | 6.7% |
| - | 1142 | 6.3% |
| W | 595 | 3.3% |
| Other values (3) | 1709 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13082 | |
| Uppercase Letter | 3774 | 21.0% |
| Dash Punctuation | 1142 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 3774 | |
| s | 2014 | |
| o | 1760 | |
| h | 1760 | |
| a | 1419 | 10.8% |
| r | 1203 | 9.2% |
| e | 595 | 4.5% |
| u | 557 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1419 | |
| N | 1203 | |
| W | 595 | |
| S | 557 | 14.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1142 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16856 | |
| Common | 1142 | 6.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 3774 | |
| s | 2014 | |
| o | 1760 | |
| h | 1760 | |
| E | 1419 | 8.4% |
| a | 1419 | 8.4% |
| N | 1203 | 7.1% |
| r | 1203 | 7.1% |
| W | 595 | 3.5% |
| e | 595 | 3.5% |
| Other values (2) | 1114 | 6.6% |
Common
| Value | Count | Frequency (%) |
| - | 1142 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17998 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 3774 | |
| s | 2014 | |
| o | 1760 | |
| h | 1760 | |
| E | 1419 | 7.9% |
| a | 1419 | 7.9% |
| N | 1203 | 6.7% |
| r | 1203 | 6.7% |
| - | 1142 | 6.3% |
| W | 595 | 3.3% |
| Other values (3) | 1709 |
agePossession
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 280.3 KiB |
| relatively new | |
|---|---|
| new property | |
| moderately new | |
| Undefined | |
| old property | |
| Other values (2) | 135 |
Length
| Max length | 18 |
|---|---|
| Median length | 14 |
| Mean length | 13.062823 |
| Min length | 9 |
Characters and Unicode
| Total characters | 48032 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | old property |
|---|---|
| 2nd row | moderately new |
| 3rd row | moderately new |
| 4th row | relatively new |
| 5th row | moderately new |
Common Values
| Value | Count | Frequency (%) |
| relatively new | 1646 | |
| new property | 593 | 16.1% |
| moderately new | 563 | 15.3% |
| Undefined | 437 | 11.9% |
| old property | 303 | 8.2% |
| under construction | 134 | 3.6% |
| undefined | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| new | 2802 | |
| relatively | 1646 | |
| property | 896 | 13.0% |
| moderately | 563 | 8.1% |
| undefined | 438 | 6.3% |
| old | 303 | 4.4% |
| under | 134 | 1.9% |
| construction | 134 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 9126 | |
| r | 4269 | |
| l | 4158 | |
| n | 4080 | |
| t | 3373 | 7.0% |
| 3239 | 6.7% | |
| y | 3105 | 6.5% |
| w | 2802 | 5.8% |
| i | 2218 | 4.6% |
| a | 2209 | 4.6% |
| Other values (10) | 9453 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 44356 | |
| Space Separator | 3239 | 6.7% |
| Uppercase Letter | 437 | 0.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 9126 | |
| r | 4269 | |
| l | 4158 | |
| n | 4080 | |
| t | 3373 | 7.6% |
| y | 3105 | 7.0% |
| w | 2802 | 6.3% |
| i | 2218 | 5.0% |
| a | 2209 | 5.0% |
| o | 2030 | 4.6% |
| Other values (8) | 6986 |
Space Separator
| Value | Count | Frequency (%) |
| 3239 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 437 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 44793 | |
| Common | 3239 | 6.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 9126 | |
| r | 4269 | |
| l | 4158 | |
| n | 4080 | |
| t | 3373 | 7.5% |
| y | 3105 | 6.9% |
| w | 2802 | 6.3% |
| i | 2218 | 5.0% |
| a | 2209 | 4.9% |
| o | 2030 | 4.5% |
| Other values (9) | 7423 |
Common
| Value | Count | Frequency (%) |
| 3239 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48032 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 9126 | |
| r | 4269 | |
| l | 4158 | |
| n | 4080 | |
| t | 3373 | 7.0% |
| 3239 | 6.7% | |
| y | 3105 | 6.5% |
| w | 2802 | 5.8% |
| i | 2218 | 4.6% |
| a | 2209 | 4.6% |
| Other values (10) | 9453 |
super_built_up_area
Real number (ℝ)
MISSING 
| Distinct | 593 |
|---|---|
| Distinct (%) | 31.6% |
| Missing | 1802 |
| Missing (%) | 49.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1925.2376 |
| Minimum | 89 |
|---|---|
| Maximum | 10000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 89 |
|---|---|
| 5-th percentile | 767 |
| Q1 | 1479.5 |
| median | 1828 |
| Q3 | 2215 |
| 95-th percentile | 3185 |
| Maximum | 10000 |
| Range | 9911 |
| Interquartile range (IQR) | 735.5 |
Descriptive statistics
| Standard deviation | 764.17218 |
|---|---|
| Coefficient of variation (CV) | 0.39692356 |
| Kurtosis | 10.349191 |
| Mean | 1925.2376 |
| Median Absolute Deviation (MAD) | 372 |
| Skewness | 1.8364563 |
| Sum | 3609820.5 |
| Variance | 583959.12 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1650 | 37 | 1.0% |
| 1950 | 37 | 1.0% |
| 2000 | 25 | 0.7% |
| 1578 | 25 | 0.7% |
| 1640 | 22 | 0.6% |
| 2150 | 22 | 0.6% |
| 1900 | 19 | 0.5% |
| 2408 | 19 | 0.5% |
| 1930 | 18 | 0.5% |
| 2812 | 17 | 0.5% |
| Other values (583) | 1634 | |
| (Missing) | 1802 |
| Value | Count | Frequency (%) |
| 89 | 1 | |
| 145 | 1 | |
| 161 | 1 | |
| 215 | 1 | |
| 216 | 1 | |
| 325 | 1 | |
| 340 | 1 | |
| 352 | 1 | |
| 380 | 1 | |
| 406 | 1 |
| Value | Count | Frequency (%) |
| 10000 | 1 | |
| 6926 | 1 | |
| 6000 | 1 | |
| 5800 | 2 | |
| 5514 | 1 | |
| 5350 | 2 | |
| 5200 | 2 | |
| 4890 | 1 | |
| 4857 | 1 | |
| 4848 | 2 |
built_up_area
Real number (ℝ)
MISSING  SKEWED 
| Distinct | 626 |
|---|---|
| Distinct (%) | 38.1% |
| Missing | 2036 |
| Missing (%) | 55.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2394.3309 |
| Minimum | 30 |
|---|---|
| Maximum | 737147 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 30 |
|---|---|
| 5-th percentile | 267 |
| Q1 | 1125 |
| median | 1650 |
| Q3 | 2400 |
| 95-th percentile | 4680 |
| Maximum | 737147 |
| Range | 737117 |
| Interquartile range (IQR) | 1275 |
Descriptive statistics
| Standard deviation | 18203.807 |
|---|---|
| Coefficient of variation (CV) | 7.6028787 |
| Kurtosis | 1621.2744 |
| Mean | 2394.3309 |
| Median Absolute Deviation (MAD) | 600 |
| Skewness | 40.144644 |
| Sum | 3929097 |
| Variance | 3.3137861 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1800 | 41 | 1.1% |
| 3240 | 36 | 1.0% |
| 1900 | 34 | 0.9% |
| 2700 | 33 | 0.9% |
| 1350 | 33 | 0.9% |
| 900 | 27 | 0.7% |
| 1600 | 26 | 0.7% |
| 1300 | 24 | 0.7% |
| 2000 | 24 | 0.7% |
| 1700 | 23 | 0.6% |
| Other values (616) | 1340 | |
| (Missing) | 2036 |
| Value | Count | Frequency (%) |
| 30 | 1 | < 0.1% |
| 50 | 3 | |
| 53 | 1 | < 0.1% |
| 55 | 1 | < 0.1% |
| 56 | 1 | < 0.1% |
| 57 | 1 | < 0.1% |
| 60 | 4 | |
| 61 | 1 | < 0.1% |
| 62 | 1 | < 0.1% |
| 67 | 2 |
| Value | Count | Frequency (%) |
| 737147 | 1 | < 0.1% |
| 11286 | 1 | < 0.1% |
| 9500 | 1 | < 0.1% |
| 9000 | 7 | |
| 8775 | 1 | < 0.1% |
| 8286 | 1 | < 0.1% |
| 8000 | 1 | < 0.1% |
| 7500 | 2 | 0.1% |
| 7450 | 1 | < 0.1% |
| 7331 | 2 | 0.1% |
carpet_area
Real number (ℝ)
MISSING  SKEWED 
| Distinct | 733 |
|---|---|
| Distinct (%) | 39.2% |
| Missing | 1805 |
| Missing (%) | 49.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2529.1795 |
| Minimum | 15 |
|---|---|
| Maximum | 607936 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 15 |
|---|---|
| 5-th percentile | 350 |
| Q1 | 843 |
| median | 1300 |
| Q3 | 1790 |
| 95-th percentile | 2950 |
| Maximum | 607936 |
| Range | 607921 |
| Interquartile range (IQR) | 947 |
Descriptive statistics
| Standard deviation | 22799.836 |
|---|---|
| Coefficient of variation (CV) | 9.0147166 |
| Kurtosis | 604.53764 |
| Mean | 2529.1795 |
| Median Absolute Deviation (MAD) | 472.5 |
| Skewness | 24.333239 |
| Sum | 4734624 |
| Variance | 5.1983254 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1400 | 42 | 1.1% |
| 1800 | 35 | 1.0% |
| 1600 | 35 | 1.0% |
| 1200 | 31 | 0.8% |
| 1500 | 29 | 0.8% |
| 1650 | 28 | 0.8% |
| 1350 | 27 | 0.7% |
| 1300 | 23 | 0.6% |
| 1450 | 22 | 0.6% |
| 1000 | 22 | 0.6% |
| Other values (723) | 1578 | |
| (Missing) | 1805 |
| Value | Count | Frequency (%) |
| 15 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 48 | 1 | < 0.1% |
| 50 | 1 | < 0.1% |
| 59 | 1 | < 0.1% |
| 60 | 1 | < 0.1% |
| 66 | 1 | < 0.1% |
| 72 | 1 | < 0.1% |
| 76.44 | 3 | |
| 77.31 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 607936 | 1 | |
| 569243 | 1 | |
| 514396 | 1 | |
| 64529 | 1 | |
| 64412 | 1 | |
| 58141 | 1 | |
| 54917 | 1 | |
| 48811 | 1 | |
| 45966 | 1 | |
| 34401 | 1 |
servant room
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.0 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3677 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2349 | |
| 1 | 1328 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2349 | |
| 1 | 1328 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2349 | |
| 1 | 1328 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3677 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2349 | |
| 1 | 1328 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3677 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2349 | |
| 1 | 1328 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3677 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2349 | |
| 1 | 1328 |
pooja room
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.0 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3677 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3021 | |
| 1 | 656 | 17.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3021 | |
| 1 | 656 | 17.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3021 | |
| 1 | 656 | 17.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3677 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3021 | |
| 1 | 656 | 17.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3677 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3021 | |
| 1 | 656 | 17.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3677 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3021 | |
| 1 | 656 | 17.8% |
store room
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.0 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3677 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3339 | |
| 1 | 338 | 9.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3339 | |
| 1 | 338 | 9.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3339 | |
| 1 | 338 | 9.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3677 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3339 | |
| 1 | 338 | 9.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3677 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3339 | |
| 1 | 338 | 9.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3677 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3339 | |
| 1 | 338 | 9.2% |
study room
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.0 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3677 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2972 | |
| 1 | 705 | 19.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2972 | |
| 1 | 705 | 19.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2972 | |
| 1 | 705 | 19.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3677 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2972 | |
| 1 | 705 | 19.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3677 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2972 | |
| 1 | 705 | 19.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3677 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2972 | |
| 1 | 705 | 19.2% |
others
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.0 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3677 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3272 | |
| 1 | 405 | 11.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3272 | |
| 1 | 405 | 11.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3272 | |
| 1 | 405 | 11.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3677 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3272 | |
| 1 | 405 | 11.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3677 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3272 | |
| 1 | 405 | 11.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3677 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3272 | |
| 1 | 405 | 11.0% |
furniture_labels
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.0 KiB |
| 0 | |
|---|---|
| 2 | |
| 1 | 203 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3677 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 0 |
| 3rd row | 2 |
| 4th row | 0 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2436 | |
| 2 | 1038 | |
| 1 | 203 | 5.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2436 | |
| 2 | 1038 | |
| 1 | 203 | 5.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2436 | |
| 2 | 1038 | |
| 1 | 203 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3677 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2436 | |
| 2 | 1038 | |
| 1 | 203 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3677 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2436 | |
| 2 | 1038 | |
| 1 | 203 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3677 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2436 | |
| 2 | 1038 | |
| 1 | 203 | 5.5% |
luxury_score
Real number (ℝ)
ZEROS 
| Distinct | 116 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 55.621431 |
| Minimum | 0 |
|---|---|
| Maximum | 136 |
| Zeros | 481 |
| Zeros (%) | 13.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 23 |
| median | 45 |
| Q3 | 84 |
| 95-th percentile | 136 |
| Maximum | 136 |
| Range | 136 |
| Interquartile range (IQR) | 61 |
Descriptive statistics
| Standard deviation | 41.706065 |
|---|---|
| Coefficient of variation (CV) | 0.74982006 |
| Kurtosis | -0.87842812 |
| Mean | 55.621431 |
| Median Absolute Deviation (MAD) | 29 |
| Skewness | 0.47593511 |
| Sum | 204520 |
| Variance | 1739.3958 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 481 | 13.1% |
| 35 | 418 | 11.4% |
| 136 | 215 | 5.8% |
| 28 | 99 | 2.7% |
| 21 | 83 | 2.3% |
| 52 | 79 | 2.1% |
| 15 | 73 | 2.0% |
| 45 | 64 | 1.7% |
| 30 | 63 | 1.7% |
| 127 | 60 | 1.6% |
| Other values (106) | 2042 |
| Value | Count | Frequency (%) |
| 0 | 481 | |
| 6 | 7 | 0.2% |
| 7 | 52 | 1.4% |
| 8 | 49 | 1.3% |
| 9 | 1 | < 0.1% |
| 13 | 14 | 0.4% |
| 14 | 34 | 0.9% |
| 15 | 73 | 2.0% |
| 16 | 26 | 0.7% |
| 17 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 136 | 215 | |
| 130 | 20 | 0.5% |
| 129 | 43 | 1.2% |
| 128 | 17 | 0.5% |
| 127 | 60 | 1.6% |
| 123 | 10 | 0.3% |
| 122 | 35 | 1.0% |
| 121 | 24 | 0.7% |
| 120 | 35 | 1.0% |
| 119 | 18 | 0.5% |
| society | property_type | sector | price | price_per_sqft | area | areaWithType | bedRoom | bathroom | balcony | floorNum | facing | agePossession | super_built_up_area | built_up_area | carpet_area | servant room | pooja room | store room | study room | others | furniture_labels | luxury_score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | the lions cghs | flat | sector 56 | 1.99 | 11055.0 | 1800.09 | Super Built up area 2400(222.97 sq.m.)Built Up area: 2000 sq.ft. (185.81 sq.m.)Carpet area: 1800 sq.ft. (167.23 sq.m.) | 4 | 4 | 3+ | NaN | East | old property | 2400.0 | 2000.0 | 1800.0 | 0 | 1 | 0 | 0 | 1 | 2 | 92 |
| 1 | bestech park view residency | flat | sector 2 | 0.98 | 6261.0 | 1565.25 | Super Built up area 1565(145.39 sq.m.) | 2 | 2 | 3 | NaN | South-West | moderately new | 1565.0 | NaN | NaN | 0 | 0 | 0 | 1 | 0 | 0 | 75 |
| 2 | bptp freedom park life | flat | sector 57 | 5.50 | 8982.0 | 6123.36 | Built Up area: 5010 (465.44 sq.m.) | 5 | 6 | 3+ | NaN | East | moderately new | NaN | 5010.0 | NaN | 1 | 1 | 0 | 0 | 0 | 2 | 120 |
| 3 | ss the leaf | flat | sector 85 | 1.25 | 7179.0 | 1741.19 | Super Built up area 1741(161.74 sq.m.)Built Up area: 1730 sq.ft. (160.72 sq.m.)Carpet area: 1720 sq.ft. (159.79 sq.m.) | 2 | 2 | 3 | NaN | South-East | relatively new | 1741.0 | 1730.0 | 1720.0 | 0 | 0 | 0 | 0 | 0 | 0 | 35 |
| 4 | vatika city homes | flat | sector 83 | 1.05 | 8571.0 | 1225.06 | Super Built up area 1740(161.65 sq.m.)Carpet area: 1225 sq.ft. (113.81 sq.m.) | 3 | 3 | 3 | NaN | South-East | moderately new | 1740.0 | NaN | 1225.0 | 0 | 1 | 0 | 0 | 0 | 2 | 81 |
| 5 | aipl the peaceful homes | flat | sector 70a | 2.80 | 11914.0 | 2350.18 | Super Built up area 2350(218.32 sq.m.)Carpet area: 1322 sq.ft. (122.82 sq.m.) | 3 | 4 | 3 | 22.0 | North-East | relatively new | 2350.0 | NaN | 1322.0 | 1 | 0 | 0 | 0 | 0 | 0 | 35 |
| 6 | ss the leaf | flat | sector 85 | 1.04 | 8320.0 | 1250.00 | Super Built up area 1640(152.36 sq.m.)Built Up area: 1550 sq.ft. (144 sq.m.)Carpet area: 1250 sq.ft. (116.13 sq.m.) | 2 | 2 | 3 | NaN | East | new property | 1640.0 | 1550.0 | 1250.0 | 0 | 0 | 0 | 0 | 1 | 0 | 116 |
| 7 | international city by sobha phase 1 | house | sector 109 | 5.90 | 24280.0 | 2429.98 | Plot area 270(225.75 sq.m.) | 4 | 5 | 2 | 2.0 | East | relatively new | NaN | 2430.0 | NaN | 1 | 0 | 0 | 0 | 0 | 2 | 90 |
| 8 | tulip violet | flat | sector 69 | 1.55 | 9822.0 | 1578.09 | Super Built up area 1578(146.6 sq.m.) | 3 | 3 | 2 | NaN | North-East | relatively new | 1578.0 | NaN | NaN | 0 | 1 | 0 | 0 | 0 | 2 | 127 |
| 9 | ats kocoon | flat | sector 109 | 2.95 | 10350.0 | 2850.24 | Built Up area: 3150 (292.64 sq.m.)Carpet area: 2850 sq.ft. (264.77 sq.m.) | 4 | 4 | 3+ | NaN | South-East | relatively new | NaN | 3150.0 | 2850.0 | 1 | 0 | 1 | 0 | 0 | 2 | 120 |
| society | property_type | sector | price | price_per_sqft | area | areaWithType | bedRoom | bathroom | balcony | floorNum | facing | agePossession | super_built_up_area | built_up_area | carpet_area | servant room | pooja room | store room | study room | others | furniture_labels | luxury_score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3792 | smart world orchard | flat | sector 61 | 1.65 | 13750.0 | 1200.00 | Carpet area: 1200 (111.48 sq.m.) | 2 | 2 | 2 | NaN | NaN | under construction | NaN | NaN | 1200.00 | 0 | 0 | 0 | 1 | 0 | 0 | 8 |
| 3793 | smart world orchard | flat | sector 61 | 1.60 | 13913.0 | 1150.00 | Carpet area: 1150 (106.84 sq.m.) | 2 | 2 | 2 | NaN | North-East | new property | NaN | NaN | 1150.00 | 0 | 0 | 0 | 1 | 1 | 0 | 20 |
| 3794 | mvn athens | flat | sohna road | 0.24 | 4210.0 | 570.07 | Super Built up area 570(52.95 sq.m.) | 2 | 2 | 1 | 12.0 | NaN | relatively new | 570.0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 30 |
| 3795 | sare crescent parc | flat | sector 92 | 1.00 | 4778.0 | 2092.93 | Built Up area: 2093 (194.45 sq.m.) | 4 | 4 | 3 | NaN | NaN | Undefined | NaN | 2093.0 | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3796 | vatika gurgaon | flat | sector 83 | 0.87 | 6987.0 | 1245.17 | Super Built up area 1245(115.66 sq.m.) | 2 | 2 | 2 | NaN | East | moderately new | 1245.0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 21 |
| 3798 | emaar gurgaon greens | flat | sector 102 | 1.40 | 13690.0 | 1022.64 | Super Built up area 1650(153.29 sq.m.)Carpet area: 1022.58 sq.ft. (95 sq.m.) | 3 | 3 | 3 | NaN | East | relatively new | 1650.0 | NaN | 1022.58 | 1 | 0 | 0 | 0 | 0 | 0 | 52 |
| 3799 | dlf the arbour | flat | sector 63 | 8.50 | 21519.0 | 3950.00 | Built Up area: 3950 (366.97 sq.m.) | 4 | 4 | No | NaN | NaN | Undefined | NaN | 3950.0 | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 54 |
| 3800 | emaar mgf emerald floors premier | flat | sector 65 | 2.80 | 16969.0 | 1650.07 | Super Built up area 1975(183.48 sq.m.)Built Up area: 1800 sq.ft. (167.23 sq.m.)Carpet area: 1650 sq.ft. (153.29 sq.m.) | 4 | 4 | 3 | 2.0 | North | relatively new | 1975.0 | 1800.0 | 1650.00 | 0 | 0 | 0 | 1 | 0 | 2 | 67 |
| 3801 | signature global synera | flat | sector 81 | 0.48 | 8450.0 | 568.05 | Super Built up area 657(61.04 sq.m.)Carpet area: 568 sq.ft. (52.77 sq.m.) | 2 | 2 | 1 | NaN | South-East | relatively new | 657.0 | NaN | 568.00 | 0 | 0 | 0 | 0 | 0 | 2 | 63 |
| 3802 | independent | house | sector 41 | 11.00 | 33951.0 | 3239.96 | Plot area 360(301.01 sq.m.) | 4 | 4 | 2 | 3.0 | East | new property | NaN | 3240.0 | NaN | 1 | 1 | 0 | 1 | 0 | 0 | 21 |